skip to main content


Search for: All records

Creators/Authors contains: "Aylward, Frank O."

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. Abstract

    Viruses of the phylumNucleocytoviricota, often referred to as “giant viruses,” are prevalent in various environments around the globe and play significant roles in shaping eukaryotic diversity and activities in global ecosystems. Given the extensive phylogenetic diversity within this viral group and the highly complex composition of their genomes, taxonomic classification of giant viruses, particularly incomplete metagenome-assembled genomes (MAGs) can present a considerable challenge. Here we developed TIGTOG (TaxonomicInformation ofGiant viruses usingTrademarkOrthologousGroups), a machine learning-based approach to predict the taxonomic classification of novel giant virus MAGs based on profiles of protein family content. We applied a random forest algorithm to a training set of 1531 quality-checked, phylogenetically diverseNucleocytoviricotagenomes using pre-selected sets of giant virus orthologous groups (GVOGs). The classification models were predictive of viral taxonomic assignments with a cross-validation accuracy of 99.6% at the order level and 97.3% at the family level. We found that no individual GVOGs or genome features significantly influenced the algorithm’s performance or the models’ predictions, indicating that classification predictions were based on a comprehensive genomic signature, which reduced the necessity of a fixed set of marker genes for taxonomic assigning purposes. Our classification models were validated with an independent test set of 823 giant virus genomes with varied genomic completeness and taxonomy and demonstrated an accuracy of 98.6% and 95.9% at the order and family level, respectively. Our results indicate that protein family profiles can be used to accurately classify large DNA viruses at different taxonomic levels and provide a fast and accurate method for the classification of giant viruses. This approach could easily be adapted to other viral groups.

     
    more » « less
  2. Abstract

    Viruses of the phylum Nucleocytoviricota are ubiquitous in ocean waters and play important roles in shaping the dynamics of marine ecosystems. In this study, we leveraged the bioGEOTRACES metagenomic dataset collected across the Atlantic and Pacific Oceans to investigate the biogeography of these viruses in marine environments. We identified 330 viral genomes, including 212 in the order Imitervirales and 54 in the order Algavirales. We found that most viruses appeared to be prevalent in shallow waters (<150 m), and that viruses of the Mesomimiviridae (Imitervirales) and Prasinoviridae (Algavirales) are by far the most abundant and diverse groups in our survey. Five mesomimiviruses and one prasinovirus are particularly widespread in oligotrophic waters; annotation of these genomes revealed common stress response systems, photosynthesis-associated genes, and oxidative stress modulation genes that may be key to their broad distribution in the pelagic ocean. We identified a latitudinal pattern in viral diversity in one cruise that traversed the North and South Atlantic Ocean, with viral diversity peaking at high latitudes of the northern hemisphere. Community analyses revealed three distinct Nucleocytoviricota communities across latitudes, categorized by latitudinal distance towards the equator. Our results contribute to the understanding of the biogeography of these viruses in marine systems.

     
    more » « less
  3. Abstract

    The phylum Nucleocytoviricota includes the largest and most complex viruses known. These “giant viruses” have a long evolutionary history that dates back to the early diversification of eukaryotes, and over time they have evolved elaborate strategies for manipulating the physiology of their hosts during infection. One of the most captivating of these mechanisms involves the use of genes acquired from the host—referred to here as viral homologs or “virologs”—as a means of promoting viral propagation. The best-known examples of these are involved in mimicry, in which viral machinery “imitates” immunomodulatory elements in the vertebrate defense system. But recent findings have highlighted a vast and rapidly expanding array of other virologs that include many genes not typically found in viruses, such as those involved in translation, central carbon metabolism, cytoskeletal structure, nutrient transport, vesicular trafficking, and light harvesting. Unraveling the roles of virologs during infection as well as the evolutionary pathways through which complex functional repertoires are acquired by viruses are important frontiers at the forefront of giant virus research.

     
    more » « less
    Free, publicly-accessible full text available September 1, 2024
  4. Abstract

    Chlamydomonas reinhardtii is a unicellular eukaryotic alga that has been studied as a model organism for decades. Despite an extensive history as a model system, phylogenetic and genetic characteristics of viruses infecting this alga have remained elusive. We analyzed high-throughput genome sequence data of C. reinhardtii field isolates, and in six we discovered sequences belonging to endogenous giant viruses that reach up to several 100 kb in length. In addition, we have also discovered the entire genome of a closely related giant virus that is endogenized within the genome of Chlamydomonas incerta, the closest sequenced relative of C. reinhardtii. Endogenous giant viruses add hundreds of new gene families to the host strains, highlighting their contribution to the pangenome dynamics and interstrain genomic variability of C. reinhardtii. Our findings suggest that the endogenization of giant viruses may have important implications for structuring the population dynamics and ecology of protists in the environment.

     
    more » « less
  5. The gut of the European honey bee (Apis mellifera)possesses a relatively simple bacterial community, but little is known about its community of prophages (temperate bacteriophages integrated into the bacterial genome). Although prophages may eventually begin replicating and kill their bacterial hosts, they can also sometimes be beneficial for their hosts by conferring protection from other phage infections or encoding genes in metabolic pathways and for toxins. In this study, we explored prophages in 17 species of core bacteria in the honey bee gut and two honey bee pathogens. Out of the 181 genomes examined, 431 putative prophage regions were predicted. Among core gut bacteria, the number of prophages per genome ranged from zero to seven and prophage composition (the compositional percentage of each bacterial genome attributable to prophages) ranged from 0 to 7%.Snodgrassella alviandGilliamella apicolahad the highest median prophages per genome (3.0 ± 1.46; 3.0 ± 1.59), as well as the highest prophage composition (2.58% ± 1.4; 3.0% ± 1.59). The pathogenPaenibacillus larvaehad a higher median number of prophages (8.0 ± 5.33) and prophage composition (6.40% ± 3.08) than the pathogenMelissococcus plutoniusor any of the core bacteria. Prophage populations were highly specific to their bacterial host species, suggesting most prophages were acquired recently relative to the divergence of these bacterial groups. Furthermore, functional annotation of the predicted genes encoded within the prophage regions indicates that some prophages in the honey bee gut encode additional benefits to their bacterial hosts, such as genes in carbohydrate metabolism. Collectively, this survey suggests that prophages within the honey bee gut may contribute to the maintenance and stability of the honey bee gut microbiome and potentially modulate specific members of the bacterial community, particularlyS. alviandG. apicola.

     
    more » « less
  6. Although traditionally viewed as streamlined and simple, discoveries over the last century have revealed that viruses can exhibit surprisingly complex physical structures, genomic organization, ecological interactions, and evolutionary histories. Viruses can have physical dimensions and genome lengths that exceed many cellular lineages, and their infection strategies can involve a remarkable level of physiological remodeling of their host cells. Virus–virus communication and widespread forms of hyperparasitism have been shown to be common in the virosphere, demonstrating that dynamic ecological interactions often shape their success. And the evolutionary histories of viruses are often fraught with complexities, with chimeric genomes including genes derived from numerous distinct sources or evolved de novo. Here we will discuss many aspects of this viral complexity, with particular emphasis on large DNA viruses, and provide an outlook for future research. 
    more » « less
  7. Abstract Recent research has underscored the immense diversity and key biogeochemical roles of large DNA viruses in the ocean. Although they are important constituents of marine ecosystems, it is sometimes difficult to detect these viruses due to their large size and complex genomes. This is true for “jumbo” bacteriophages, which have genome sizes >200 kbp and large capsids reaching up to 0.45 µm in diameter. In this study, we sought to assess the genomic diversity and distribution of these bacteriophages in the ocean by generating and analyzing jumbo phage genomes from metagenomes. We recover 85 marine jumbo phages that ranged in size from 201 to 498 kilobases, and we examine their genetic similarities and biogeography together with a reference database of marine jumbo phage genomes. By analyzing Tara Oceans metagenomic data, we show that although most jumbo phages can be detected in a range of different size fractions, 17 of our bins tend to be found in those greater than 0.22 µm, potentially due to their large size. Our network-based analysis of gene-sharing patterns reveals that jumbo bacteriophages belong to five genome clusters that are typified by diverse replication strategies, genomic repertoires, and potential host ranges. Our analysis of jumbo phage distributions in the ocean reveals that depth is a major factor shaping their biogeography, with some phage genome clusters occurring preferentially in either surface or mesopelagic waters, respectively. Taken together, our findings indicate that jumbo phages are widespread community members in the ocean with complex genomic repertoires and ecological impacts that warrant further targeted investigation. 
    more » « less
  8. Casadesús, Josep (Ed.)
    The evolutionary forces that determine genome size in bacteria and archaea have been the subject of intense debate over the last few decades. Although the preferential loss of genes observed in prokaryotes is explained through the deletional bias, factors promoting and preventing the fixation of such gene losses often remain unclear. Importantly, statistical analyses on this topic typically do not consider the potential bias introduced by the shared ancestry of many lineages, which is critical when using species as data points because of the potential dependence on residuals. In this study, we investigated the genome size distributions across a broad diversity of bacteria and archaea to evaluate if this trait is phylogenetically conserved at broad phylogenetic scales. After model fit, Pagel’s lambda indicated a strong phylogenetic signal in genome size data, suggesting that the diversification of this trait is influenced by shared evolutionary histories. We used a phylogenetic generalized least-squares analysis (PGLS) to test whether phylogeny influences the predictability of genome size from dN/dS ratios and 16S copy number, two variables that have been previously linked to genome size. These results confirm that failure to account for evolutionary history can lead to biased interpretations of genome size predictors. Overall, our results indicate that although bacteria and archaea can rapidly gain and lose genetic material through gene transfers and deletions, respectively, phylogenetic signal for genome size distributions can still be recovered at broad phylogenetic scales that should be taken into account when inferring the drivers of genome size evolution. 
    more » « less
  9. Abstract

    Land use change has long-term effects on the structure of soil microbial communities, but the specific community assembly processes underlying these effects have not been identified. To investigate effects of historical land use on microbial community assembly, we sampled soils from several currently forested watersheds representing different historical land management regimes (e.g., undisturbed reference, logged, converted to agriculture). We characterized bacterial and fungal communities using amplicon sequencing and used a null model approach to quantify the relative importance of selection, dispersal, and drift processes on bacterial and fungal community assembly. We found that bacterial communities were structured by both selection and neutral (i.e., dispersal and drift) processes, while fungal communities were structured primarily by neutral processes. For both bacterial and fungal communities, selection was more important in historically disturbed soils compared with adjacent undisturbed sites, while dispersal processes were more important in undisturbed soils. Variation partitioning identified the drivers of selection to be changes in vegetation communities and soil properties (i.e., soil N availability) that occur following forest disturbance. Overall, this study casts new light on the effects of historical land use on soil microbial communities by identifying specific environmental factors that drive changes in community assembly.

     
    more » « less
  10. null (Ed.)
    The family Asfarviridae is a group of nucleo-cytoplasmic large DNA viruses (NCLDVs) of which African swine fever virus (ASFV) is well-characterized. Recently the discovery of several Asfarviridae members other than ASFV has suggested that this family represents a diverse and cosmopolitan group of viruses, but the genomics and distribution of this family have not been studied in detail. To this end we analyzed five complete genomes and 35 metagenome-assembled genomes (MAGs) of viruses from this family to shed light on their evolutionary relationships and environmental distribution. The Asfarvirus MAGs derive from diverse marine, freshwater, and terrestrial habitats, underscoring the broad environmental distribution of this family. We present phylogenetic analyses using conserved marker genes and whole-genome comparison of pairwise average amino acid identity (AAI) values, revealing a high level of genomic divergence across disparate Asfarviruses. Further, we found that Asfarviridae genomes encode genes with diverse predicted metabolic roles and detectable sequence homology to proteins in bacteria, archaea, and eukaryotes, highlighting the genomic chimerism that is a salient feature of NCLDV. Our read mapping from Tara oceans metagenomic data also revealed that three Asfarviridae MAGs were present in multiple marine samples, indicating that they are widespread in the ocean. In one of these MAGs we identified four marker genes with > 95% AAI to genes sequenced from a virus that infects the dinoflagellate Heterocapsa circularisquama (HcDNAV). This suggests a potential host for this MAG, which would thereby represent a reference genome of a dinoflagellate-infecting giant virus. Together, these results show that Asfarviridae are ubiquitous, comprise similar sequence divergence as other NCLDV families, and include several members that are widespread in the ocean and potentially infect ecologically important protists. 
    more » « less